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ABSTRACT 



A flexible, policy-based, mechanism for managing, 
monitoring, and prioritizing traffic within a network and 
allocating bandwidth to achieve true quality of service 
(QoS) is provided. According to one aspect of the present 
invention, a method is provided for managing bandwidth 
allocation in a network that employs a non-deterministic 
access protocol, such as an Ethernet network. A packet 
forwarding device receives information indicative of a set of 
traffic groups, such as: a MAC address, or IEEE 802. lp 
priority indicator or 802.1 Q frame tag, if the QoS policy is 
based upon individual station applications; or a physical port 
if the QoS policy is based purely upon topology. The packet 
forwarding device additionally receives bandwidth param- 
eters corresponding to the traffic groups. After receiving a 
packet associated with one of the traffic groups on a first 
port, the packet forwarding device schedules the packet for 
transmission from a second port based upon bandwidth 
parameters corresponding to the traffic group with which the 
packet is associated. According to another aspect of the 
present invention, a method is provided for managing band- 
width allocation in a packet forwarding device. The packet 
forwarding device receives information indicative of a set of 
traffic groups. The packet forwarding device additionally 
receives information defining a QoS policy for the traffic 
groups. After a packet is received by the packet forwarding 
device, a traffic group with which the packet is associated is 
identified. Subsequently, rather than relying on an end-to- 
end signaling protocol for scheduling, the packet is sched- 
uled for transmission based upon the QoS policy for the 
identified traffic group. 

28 Claims, 5 Drawing Sheets 
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POLICY BASED QUALITY OF SERVICE 

This is a continuation of application Ser. No. 09/018,103, 
filed on Feb. 3, 1998, now issued U.S. Pat. No. 6,104,700. 

This application claims the benefit of U.S. Provisional S 
Application No. 60/057,371, filed Aug. 29, 1997. 

COPYRIGHT NOTICE 

Contained herein is material that is subject to copyright JQ 
protection. The copyright owner has no objection to the 
facsimile reproduction of the patent disclosure by any per- 
son as it appears in the Patent and Trademark Office patent 
files or records, but otherwise reserves all rights to the 
copyright whatsoever. ^ 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The invention relates generally to the field of computer 
networking devices. More particularly, the invention relates 20 
to a flexible, policy-based mechanism for managing, 
monitoring, and prioritizing traffic within a network and 
allocating bandwidth to achieve true Quality of Service 
(QoS). 

2, Description of the Related Art 25 
Network traffic today is more diverse and bandwidth- 
intensive than ever before. Today's intranets are expected to 
support interactive multimedia, full-motion video, rich 
graphic images and digital photography. Expectations about 3Q 
the quality and timely presentation of information received 
from networks is higher than ever. Increased network speed 
and bandwidth alone will not satisfy the high demands of 
today's intranets. 

The Internet Engineering Task Force (IETF) is working 35 
on a draft standard for the Resource Reservation Protocol 
(RSVP), an Internet Protocol-(IP) based protocol that allows 
end-stations, such as desktop computers, to request and 
reserve resources within and across networks. Essentially, 
RSVP is an end-to-end protocol that defines a means of 40 
communicating the desired Quality of Service between 
routers. RSVP is receiver initiated. The end-station that is 
receiving the data stream communicates its requirements to 
an adjacent router and those requirements are passed back to 
all intervening routers between the receiving end-station and 45 
the source of the data stream and finally to the source of the 
data stream itself. Therefore, it should be apparent that 
RSVP must be implemented across the whole network. That 
is, both end-stations (e.g., the source and destination of the 
data stream) and every router in between should be RSVP 50 
compliant in order to accommodate the receiving end- 
station's request. 

While RSVP allows applications to obtain some degree of 
guaranteed performance, it is a first-come, first-served 
protocol, which means if there are no other controls within 55 
the network, an application using RSVP may reserve and 
consume resources that could be needed or more effectively 
utilized by some other mission-critical application. A further 
limitation of this approach to resource allocation is the fact 
that end-stations and routers must be altered to be RSVP 6Q 
compliant. Finally, RSVP lacks adequate policy mechanisms 
for allowing differentiation between various traffic flows. It 
should be appreciated that without a policy system in place, 
the network manager loses control. 

Recent attempts to facilitate traffic differentiation and 65 
prioritization include draft standards specified by the Insti- 
tute of Electrical and Electronics Engineers (IEEE). The 
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IEEE 802.1 Q draft standard provides a packet format for an 
application to specify which Virtual Local Area Network 
(VLAN) a packet belongs to and the priority of the packet. 
The IEEE 802. lp committee provides a guideline to classify 
traffic based on a priority indicator in an 802.1 Q frame tag. 
This allows VLANs to be grouped into eight different traffic 
classes or priorities. The IEEE 802. lp committee does not, 
however, define the mechanism to service these traffic 
classes. 

What is needed is a way to provide true Quality of Service 
("QoS") in a network employing a non-deterministic access 
protocol, such as an Ethernet network, that not only has the 
ability to prioritize and service different traffic classes, but 
additionally provides bandwidth management and guaran- 
tees a quantifiable measure of service for packets associated 
with a particular traffic class. More specifically, with respect 
to bandwidth management, it is desirable to employ a 
weighted fair queuing delivery schedule which shares avail- 
able bandwidth so that high priority traffic is usually sent 
first, but tow priority traffic is still guaranteed an acceptable 
minimum bandwidth allocation. Also, it is desirable to 
centralize the control over bandwidth allocation and traffic 
priority to allow for QoS without having to upgrade or alter 
end-stations and existing routers as is typically required by 
end-to-end protocol solutions. Further, it would be advan- 
tageous to put the control in the hands of network managers 
by performing bandwidth allocation and traffic prioritization 
based upon a set of manager-defined administrative policies. 
Finally, since there are many levels of control a network 
manager may elect to administer, it is desirable to provide a 
variety of scheduling mechanisms based upon a core set of 
QoS profile attributes. 

BRIEF SUMMARY OF THE INVENTION 
A flexible, policy-based, mechanism for managing, 
monitoring, and prioritizing traffic within a network and 
allocating bandwidth to achieve true Quality of Service 
(QoS) is described. According to one aspect of the present 
invention, a method is provided for managing bandwidth 
allocation in a network that employs a non-determninistic 
access protocol. A packet forwarding device receives infor- 
mation indicative of a set of traffic groups. The packet 
forwarding device additionally receives parameters, such as 
bandwidth and priority parameters, corresponding to the 
traffic groups. After receiving a packet associated with one 
of the traffic groups on a first port, the packet forwarding 
device schedules the packet for transmission from a second 
port based upon parameters corresponding to the traffic 
group with which the packet is associated. Advantageously, 
in this manner, a weighted fair queuing schedule that shares 
bandwidth according to some set of rules may be achieved. 

According to another aspect of the present invention, a 
method is provided for managing bandwidth allocation and 
traffic prioritization in a packet forwarding device. The 
packet forwarding device receives information indicative of 
a set of traic groups. The packet forwarding device addi- 
tionally receives information defining a Quality of Service 
(QoS) policy for the traffic groups. After a packet is received 
by the packet forwarding device, a traffic group with which 
the packet is associated is identified. Subsequently, rather 
than relying on an end-to-end signaling protocol for 
scheduling, the packet is scheduled for transmission based 
upon the QoS policy for the identified traffic group. 
Therefore, bandwidth allocation and traffic prioritization are 
based upon a set of administrative policies over which the 
network manager retains control. 

According to yet another aspect of the present invention, 
a number of QoS queues are provided at each port of the 
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packet forwarding device. A current bandwidth metric is in machine-executable instructions, which may be used to 

determined for each of the QoS queues for a particular port. cause a general-purpose or special-purpose processor pro- 

The QoS queues are divided into two groups based upon gramme d with the instructions to perform the steps. 

their respective bandwidth metrics and their respective mini- Alternatively, the steps may be performed by a combination 

mum bandwidth requirements. Subsequently, the groups are 5 of hardware and software. While, embodiments of the 

used as a first level arbitration mechanism to select a QoS present invention will be described with reference to a high 

queue that will source the next packet. speed Ethernet switch, the method and apparatus described 

Other features of the present invention will be apparent herein are equally applicable to other types of network 

from the accompanying drawings and from the detailed devices or packet forwarding devices. 

description which follows. 10 . _ 1 _ . . . , . ^ 

An Exemplary Switch Architecture 

BRIEF DESCRIPTION OF THE SEVERAL An overview of the architecture of a switch 100 in which 

VIEWS OF THE DRAWINGS one embodiment of the present invention may be imple- 

The present invention is illustrated by way of example, mented is illustrated by FIG. LA The central memory 

and not by way of limitation, in the figures of the accom- 15 architecture depicted includes multiple ports 105 and 110 

panying drawings and in which like reference numerals refer ca °h coupled via a channel to a filtering/forwarding engine 

to similar elements and in which: 11S * coupled to the filtering/forwarding engine 115 is 

FIG. 1A is a simplified block diagram of an exemplary * forWM ^ ^ } 2 °A * ^ R ™^™ 
switch architecture in which one embodiment of the present M Memory (RAM) 125, and a Central Processing Unit (CPU) 

invention may be implemented. 

FIG. IB is a logical view of the interaction between According to one embodiment each channel is capable of 

switch processing blocks according to one embodiment of ™PP°rt«>g a data transfer rate of one gigabit per second in 

the present invention. f ransmlt dlrectlon and one gigabit per second m the 

_„ „ . „ .. ... receive direction, thereby providing 2 Gb/s full-duplex capa- 

FIG. 2 is a flow diagram illustrating high level bandwidth 25 m chaQnel ^^^^ ^ channels 5e mnm 

management and traffic prioritization processing according fi d tQ 0Qe Gi bit Ethemet netwofk connection 

to one embodiment of the present invention. Qr eigfat Fast £thernet netWQrk COIinections . 

FIG. 3 is a flow diagram illustrating periodic evaluation of ^ filtering/forwarding engine 115 includes an address 

QoS categories according to one embodiment of the present ^ fiUer (nQt shown)j a matrix (nQt shown)? ^ a buffer 

invention. manager (not shown). The address filter may provide 

FIG. 4 is a flow diagram illustrating next packet sched- bridging, routing, Virtual Local Area Network (VLAN) 

uling according to one embodiment of the present invention. tagging functions, and traffic classification. The switch 

matrix connects each channel to a central memory such as 
packet RAM 125. The buffer manager controls data buffers 
and packet queue structures and controls and coordinates 
A flexible, policy-based, mechanism for managing, accesses to and from the packet RAM 125. 
monitoring, and prioritizing traffic within a network and The forwarding database 120 may store information use- 
allocating bandwidth to achieve true Quality of Service fyj for making forwarding decisions, such as layer 2 (e.g., 
(QoS) is described. "Quality of Service" in this context 40 Media Access Control (MAC) layer), layer 3 (e.g., Network 
essentially means that there is a quantifiable measure of the layer), and/or layer 4 (e.g., Transport layer) forwarding 
service being provided. The measure of service being pro- information, among other things. The switch 100 forwards a 
vided may be in terms of a packet loss rate, a maximum packet received at an input port to an output port by 
delay, a committed minimum bandwidth, or a limited maxi- performing a search on the forwarding database using 
mum bandwidth, for example. 45 address information contained within the header of the 
In the present invention, a number of QoS queues may be received packet. If a matching entry is found, a forwarding 
provided at each port of a packet forwarding device, such as decision is constructed that indicates to which output port 
a Local Area Network (LAN) switch. Based upon a set of the received packet should be forwarded, if any. Otherwise, 
QoS parameters, various types of traffic can be distinguished the packet is forwarded to the CPU 130 for assistance in 
and associated with particular QoS queues. For example, 50 constructing a forwarding decision. 

packets associated with a first traffic group may be placed The packet RAM 125 provides buffering for packets and 

onto a first QoS queue and packets associated with another acts as an elasticity buffer for adapting between incoming 

traffic group may be placed onto a second QoS queue. When and outgoing bandwidth differences. Packet buffering is 

a port is ready to transmit the next packet, a scheduling discussed further below, 

mechanism may be employed to select which QoS queue of 5S 

the QoS queues associated with the port will provide the Logical View of Exemplary Switch Processing 

next packet for transmission. FIG. IB is a logical view of the interaction between 

In the following description, for the purposes of exemplary switch processing blocks that may be distributed 

explanation, numerous specific details are set forth in order throughout the switch 100. For example, some of the pro- 

to provide a thorough understanding of the present inven- to cessing may be performed by functional units within the 

tion. It will be apparent, however, to one skilled in the art ports of the switch and other processing may be performed 

that the present invention may be practiced without some of by the CPU 130 or by the address filter/switch matrix/buffer 

these specific details. In other instances, well-known struc- manager 115. In any event, the processing can be concep- 

tures and devices are shown in block diagram form. tually divided into a first group of functions 160 dedicated 

The present invention includes various steps, which will 65 to input processing and a second group of functions 185 

be described below. The steps of the present invention may dedicated to output processing. According to the present 

be performed by hardware components or may be embodied embodiment, the first group 160 includes a comparison 
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engine 155, an enqueue block 161, a packet classification 
block 150, and a buffer manager 165. The second group 185 
includes a dequeue block 162, a Quality of Service (QoS) 
category evaluation block 175, and a scheduler 170. 

Additionally, a user interface (UI) 145 may be provided 
for receiving various parameters from the network manager. 
The UI may be text based or graphical. In one embodiment, 
the UI 145 may include an in-band HyperText Markup 
Language (HTML) browser-based management tool which 
may be accessed by any standard web browser. In any event, 
the goal of the UI 145 is to separate high-level policy 
components, such as traffic grouping and QoS profiles from 
the details of the internal switch hardware. Thus, user 
configuration time is minimized and a consistent interface is 
provided to the user. 

The UI 145 receives information indicative of one or 
more traffic groups. This information may be provided by 
the network manager. There are several ways to define a 
traffic group. Table 1 below illustrates a variety of traffic 
classification schemes that may be supported by the UI 145. 

TABLE 1 



Policy Based Upon 



Traffic Classification 

Traffic Group Definition OS I Layer 



Applications 



Network Layer 
Topology or Groups of 
Users 

End- Station Applications 
Physical Topology 



TCP Session 
UDP Session 
RSVP Flow 

Network Layer Protocol 
Subnet or IP Address 
VLAN Identifier 
MAC Address 
802.1p or 802.1Q 
Physical Port 



Transport Layer 

Network Layer 

Link Layer 
Physical Layer 



10 



15 



25 



30 



35 



The information used to identify a traffic group typically 
depends upon what terms the QoS policy is defined. If the 
QoS policy is based on applications, traffic groups may be 
differentiated at the Transport layer by Transmission Control 
Protocol (TCP) session or User Datagram Protocol (UDP) 40 
session. For example, the network manager may provide 
information indicative of TCP source and destination ports 
and IP source and destination addresses to identify traffic 
groups. However, if the QoS policy is based upon the 
Network layer topology or groups of users, traffic group 45 
definition may be more convenient by supplying informa- 
tion regarding the Network layer protocol, such as Internet 
Protocol (IP) or Internetwork Packet Exchange (I px )» toe 
subnet or IP addresses, or VLAN identifiers. If the QoS 
policy is defined by end-station applications, then Media 50 
Access Control (MAC) addresses, IEEE 802. Ip priority 
indications, or IEEE 802.1 Q frames may be employed to 
identify traffic groups. Finally, if the QoS policy is physical 
topology based, physical port identifiers may be used to 
differentiate traffic groups. 55 

It should be noted that Table 1 merely presents an 
exemplary set of traffic group identification mechanisms. 
From the examples presented herein, additional, alternative, 
and equivalent traffic grouping schemes and policy consid- 
erations will be apparent to those of ordinary skill in the art. <jo 
For example, other state information may be useful for 
purposes of packet classification, such as the history of 
previous packets, the previous traffic load, the time of day, 
etc. 

It is appreciated that traffic classifications based upon the 65 
traffic group definitions listed above may result in overlap. 
Should the network manager define overlapping traffic 



groups, the UI 145 may issue an error message and reject the 
most recent traffic group definition, the UI 145 may issue a 
warning message to the network manager and allow the 
more specific traffic group definition to override a conflict- 
ing general traffic group definition, or the UI 145 may be 
configured to respond in another manner. 

A number of QoS queues 180 may be provided at each of 
the ports of a packet forwarding device. In one embodiment, 
a mapping of traffic groups to QoS queues 180 may be 
maintained. As traffic groups are provided by the network 
manager, the UI 145 updates the local mapping of traffic 
groups to QoS queues 180. This mapping process may be a 
one-to-one mapping of the traffic groups defined by the 
network manager to the QoS queues 180 or the mapping 
process may be more involved. For example, there may be 
more traffic groups than QoS queues 180, in which case, 
more than one traffic group will be mapped to a single QoS 
queue. Some consolidation rules for combining multiple 
traffic groups into a single QoS queue will be discussed 
below. 

At any rate, by providing a layer of abstraction in this 
manner, the network manager need not be burdened with the 
underlying implementation details, such as the number of 
QoS queues per port and other queuing parameters. Another 
advantage achieved by this layer of abstraction between the 
traffic group definitions and the physical QoS queues is the 
fact that the UI 145 is now decoupled from the underlying 
implementation. Therefore, the UI 145 need not be updated 
if the hardware QoS implementation changes. For example, 
software providing for traffic group definition need not be 
changed simply because the number of QoS queues per port 
provided by the hardware changes. 

The input data stream is received by the comparison 
engine 155 from input switch ports (not shown). Under the 
direction of the packet classification process 150, the com- 
parison engine 155 determines with which of the previously 
defined traffic groups a packet in the data stream is associ- 
ated. The packet classification block 150 may employ the 
traffic group indications provided by the network manager to 
provide the comparison engine 155 with information regard- 
ing locations and fields to be compared or ignored within the 
header of a received packet, for example. It should be 
appreciated if the comparison required for traffic classifica- 
tion is straightforward, such as in a conventional packet 
forwarding device, then the comparison engine 155 and the 
packet classification block 150 may be combined. 

The packet classification block 150 in conjunction with 
the UI 145 provide a network manager with a flexible 
mechanism to control traffic prioritization and bandwidth 
allocation through the switch 100. Importantly, no end-to- 
end signaling protocol needs to be implemented by the 
network devices. For example, the end-station that is to 
receive the data stream need not reserve bandwidth on each 
of the intermediate devices between it and the source of the 
data stream. Rather, a packet forwarding device employing 
the present invention can provide some benefit to the net- 
work without requiring routers and/or end-stations to do 
anything in particular to identify traffic. Thus, traffic priority 
may be enforced by the switch 100 and QoS may be 
delivered to applications without altering routers or end- 
stations. 

According to one embodiment, the buffer manager 165 
participates in policy based QoS by controlling the alloca- 
tion of buffers within the packet RAM 125. Buffers may be 
dynamically allocated to QoS queues 180 as needed, within 
constraints established by QoS profile attributes, which are 
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discussed below. The buffer manager 165 may maintain 
several programmable variables for each QoS queue. For 
example, a Minimum Buffer Allocation and a Maximum 
Queue Depth may be provided for each QoS queue. The 
Minimum Buffer Allocation essentially reserves some mini- 5 
mum number of buffers in the packet RAM 125 for the QoS 
queue with which it is associated. The Maximum Queue 
Depth establishes the maximum number of buffers that can 
be placed on a given QoS queue. The buffer manager 165 
also maintains a Current Queue Depth for each QoS queue 
to assure the maximum depth is not exceeded. For example, 
before allowing a buffer to be added to a given QoS queue, 
the buffer manager 165 may compare the Maximum Queue 
Depth to the Current Queue Depth to ensure the Maximum 
Queue Depth is not exceeded. 

Variables are also maintained for tracking free buffers in 15 
the packet RAM 125. At initialization, a Buffers Free Count 
contains the total number of buffers available in the packet 
RAM 125 and a Buffers Reserved Count contains the sum of 
the minimum buffer allocations for the QoS queues 180. As 
packets are received they are stored in free buffers, and the 20 
Buffers Free Count is decremented by the number of buffers 
used for such storage. After the appropriate QoS queue has 
been identified the buffer manager 165 instructs the enqueue 
block 161 to add the packet to the QoS queue. The enqueue 
block 161 links the packet to the identified queue provided 25 
that the Current Queue Depth is less than the Maximum 
Queue Depth and either (1) the Current Queue Depth is less 
than the Minimum Buffer Allocation or (2) the Buffers 
Reserved Count is less than the Buffers Free Count. 
Therefore, if a QoS queue exceeds its reserve of buffers 2Q 
(e.g., Minimum Buffer Allocation), to the extent that addi- 
tional buffers remain free, the QoS queue may continue to 
grow. Otherwise, the enqueue block 161 will discard the 
packet, the buffers are returned to the free pool, and the 
Buffers Free Count is increased by the number of buffers that 
would have been consumed by the packet. When a packet is 3S 
successfully linked to a QoS queue, the Current Queue 
Depth for that QoS queue is increased by the number of 
buffers used by the packet. If, prior to the addition of the 
packet to the queue, the Current Queue Depth was less than 
the Minimum Buffer Allocation then the Buffers Reserved 40 
Count is decreased by the lesser of (1) the number of buffers 
in the packet or (2) the difference between the Current 
Queue Depth and the Minimum Buffer Allocation. 

The QoS category evaluation process 175 separates the 
QoS queues into a plurality of categories based upon a set of 45 
bandwidth parameters. The scheduler 170 uses the grouping 
provided by the QoS category evaluation process 175 to 
select an appropriate QoS queue for sourcing the next packet 
for a particular port. The evaluation of QoS queue categories 
may be performed periodically or upon command by the 5Q 
scheduler 170, for example. Periodic evaluation of QoS 
categories and scheduling is discussed in further detail 
below. 

Responsive to the scheduler 170 the dequeue block 162 
retrieves a packet from a specified QoS queue. After the 
packet has been transmitted, the buffer variables are 
updated. The Buffers Free Count is increased and the 
Current Queue Depth is decreased by the number of buffers 
utilized to store the packet. If the resulting Current Queue 
Depth is less than the Minimum Buffer Allocation, then the 
Buffers Reserved Count is increased by the lesser of the 60 
number of buffers utilized to store the packet or the differ- 
ence between the Current Queue Depth and the Minimum 
Buffer Allocation. 

QoS Profile Attributes 65 

Setting QoS policy is a combination of identifying traffic 
groups and defining QoS profiles for those traffic groups. 
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According to one embodiment, each individual traffic group 
may be associated with a QoS profile. However, in alterna- 
tive embodiments, multiple traffic groups may share a com- 
mon QoS profile. Having described traffic group classifica- 
tion and identification above, QoS profile attributes (also 
referred to as parameters) will now be discussed. 

Several queuing mechanisms may be implemented using 
one or more of the following parameters associated with a 
traffic group: (1) minimum bandwidth, (2) maximum 
bandwidth, (3) peak bandwidth, (4) maximum delay, and (5) 
relative priority. In general, the minimum, maximum, and 
peak bandwidth parameter may be expressed in Mbps, a 
percentage of total bandwidth, or any other convenient 
representation. 

Minimum bandwidth indicates the minimum amount of 
bandwidth a particular traffic group needs to be provided 
over a defined time period. If the sum of the minimum 
bandwidths for all traflic groups defined is less than 100% of 
the available bandwidth, then the scheduling processing, 
discussed below, can assure that each traffic group will 
receive at least the minimum bandwidth requested. 

Maximum bandwidth is the maximum sustained band- 
width the traffic group can realize over a defined time period. 
In contrast, peak bandwidth represents the bandwidth a 
traffic group may utilize during a particular time interval in 
excess of the maximum bandwidth. The peak bandwidth 
parameter may be used to limit traffic bursts for the traffic 
group with which it is associated. The peak bandwidth also 
determines how quickly the traffic group's current band- 
width will converge to the maximum bandwidth. By pro- 
viding a peak bandwidth value that is much higher than the 
maximum bandwidth, if sufficient bandwidth is available, 
the maximum bandwidth will be achieved relatively quickly. 
In contrast, a peak bandwidth that is only slightly higher that 
the maximum bandwidth will cause the convergence to the 
maximum bandwidth to be more gradual. 

Maximum delay specifies a time period beyond which 
further delay cannot be tolerated for the particular traffic 
group. Packets comprising the traffic group that are for- 
warded by the switch 100 are guaranteed not to be delayed 
by more than the maximum delay specified. 

Relative priority defines the relative importance of a 
particular traffic group with respect to other traffic groups. 
As will be discussed further below, within the same QoS 
category, traffic groups with a higher priority are preferred 
over those with lower priorities. 

This small set of parameters in combination with the 
variety of traffic classification schemes gives a network 
manager enormous control and flexibility in prioritizing and 
managing traffic flowing through packet forwarding devices 
in a network. For example, the QoS profile of a video traffic 
group, identified by UDP session, might be defined to have 
a high priority and a minimum bandwidth of 5 Mbps, while 
the QoS profile of an engineering traffic group, identified by 
VLAN, may be set to a second priority, a minimum band- 
width of 30 Mbps, a maximum bandwidth of 50 Mbps, and 
a peak bandwidth of 60 Mbps. Concurrently, the QoS profile 
of a World Wide Web (WWW) traffic group, identified by 
protocol (e.g., IP), may be set to have a low priority, a 
minimum bandwidth of 0 Mbps, a maximum bandwidth of 
100%, and a peak bandwidth of 100%. 

Consolidation Rules 

It was mentioned earlier that multiple traffic groups may 
be mapped to a single QoS queue. This may be accom- 
plished by maintaining an independent set of variables (e.g., 
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minimum bandwidth, maximum bandwidth, peak 
bandwidth, maximum delay, and relative priority) for each 
QoS queue in addition to those already associated with each 
traffic group and following the general consolidation rules 
outlined below. 

Briefly, when the mapping from traffic groups to QoS 
queues is one-to-one, the determination of a particular QoS 
queues' attributes is straightforward. The QoS queue's 
attributes simply equal the traffic group's attributes. 
However, when combining multiple traffic groups that do 
not share a common QoS profile onto a single QoS queue, 
the following general consolidation rules are suggested: (1) 
add minimum attributes of the traffic groups being combined 
to arrive at an appropriate minimum attribute for the target 
QoS queue (e.g., the QoS queue in which the traffic will be 
merged), (2) use the largest of maximum attributes to arrive 
at an appropriate value for a maximum attribute for the 
target QoS queue, and (3) avoid merging traffic groups that 
have different relative priorities. This last rule suggests the 
number of priority levels provided should be less than or 
equal to the number of QoS queues supported by the 
implementation to assure traffic groups with different pri- 
orities are not combined in the same QoS queue. 

Importantly, when a network manager has determined that 
multiple traffic groups will share a common QoS profile, the 
consolidation rules need not apply, as the network manager 
has already, in effect, manually consolidated the parameters. 

Bandwidth Management and Traffic Prioritization 

Having described an exemplary environment in which 
one embodiment of the present invention may be 
implemented, bandwidth management and traffic prioritiza- 
tion will now be described with reference to FIG. 2. FIG. 2 
is a flow diagram illustrating the high level bandwidth 
management and traffic prioritization processing according 
to one embodiment of the present invention. In this 
embodiment, at step 210, a manager-defined QoS policy 
may be received via the UI 145, for example. The QoS 
policy is a combination of traffic groups and QoS profile 
attributes corresponding to those traffic groups. 

At step 220, a packet is received by the switch 100. Before 
the packet can be placed onto a QoS queue for transmission, 
the traffic group to which the packet belongs is identified at 
step 230. Typically, information in the packet header, for 
example, can be compared to the traffic group criteria 
established by the network manager to identify the traffic 
group to which the packet belongs. This comparison or 
matching process may be achieved by programming filters 
in the switch 100 that allow classification of traffic. Accord- 
ing to one embodiment, the packet may be identified using 
the traffic group definitions listed in Table 1 . 

At step 250, enqueue processing is performed. The packet 
is added to the rear of the appropriate QoS queue for the 
identified traffic group. Importantly, if a maximum delay has 
been assigned to the traffic group with which the packet is 
associated, then the packet should either be dropped or 
transmitted within the period specified. According to one 
embodiment, this may be accomplished by limiting the 
depth (also referred to as length) of the corresponding QoS 
queue. Given the minimum bandwidth of the QoS queue and 
the maximum delay the traffic group can withstand, a 
maximum depth for the QoS queue can be calculated. If the 
QoS queue length remains less than or equal to the maxi- 
mum length, then the packet is added to the QoS queue. 
However, if the QoS queue length would exceed the maxi- 
mum length by the addition, then the packet is dropped. 
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At step 260, scheduling is performed. The scheduling/ 
dequeuing processing involves determining the appropriate 
QoS queue group, selecting the appropriate QoS queue 
within that QoS queue group, and removing the packet at the 
S front of the selected QoS queue. This selected packet will be 
the next packet the port transmits. Scheduling will be 
discussed further below. 

Evaluation of QoS Categories 

10 According to one embodiment of the present invention, it 
is advantageous to divide the QoS queues into at least two 
categories. The categories may be defined based upon the 
maximum bandwidth, the minimum bandwidth, the peak 
bandwidth, and the "current bandwidth." The current band- 

15 width should not be mistaken for a bandwidth at an instant 
in time, rather the current bandwidth is a moving average 
that is updated periodically upon the expiration of a prede- 
termined time period. Empirical data suggests this predeter- 
mined time period should be on the order of ten packet 

20 times, wherein a packet time is the time required to transmit 
a packet. However, depending upon the environment and the 
nature of the traffic, a value in the range of one to one 
hundred packet times may be more suitable. 

25 The members of the first category ("Category A") are 
those QoS queues which have a current bandwidth that is 
below their peak bandwidth and below their minimum 
bandwidth. Members of the second category ("Category B") 
include those QoS queues that have a current bandwidth that 

3Q is greater than or equal to their minimum bandwidth, but less 
than both their maximum bandwidth and their peak band- 
width. The remaining QoS queues (e.g., those having a 
current bandwidth that is greater than or equal to either the 
peak bandwidth or the maximum bandwidth) are ineligible 

35 for transmission. These QoS queues that are ineligible for 
transmission can be considered a third category ("Category 
C"). With this overview of QoS categories, an exemplary 
process for periodic evaluation of QoS categories will now 
be described. 

40 FIG. 3 is a flow diagram illustrating periodic evaluation of 
QoS categories according to one embodiment of the present 
invention. In this embodiment, at step 310, processing loops 
until the predetermined evaluation time period has expired. 
For example, a test may be performed to determine if the 

45 current time is greater than or equal to the last evaluation 
time plus the predetermined evaluation time interval. 
Alternatively, the evaluation process may be triggered by an 
interrupt. In any event, when it is time to evaluate the QoS 
queue categorization, processing continues with step 330. 

50 It will be appreciated that the time interval chosen for the 
predetermined evaluation time period should not be too long 
or too short. If the time interval is too long, one QoS queue 
might be allowed to monopolize the link until its maximum 
bandwidth is achieved while other QoS queues remain idle. 

55 If the time interval is too short, transmitting a single packet 
or remaining idle for a single packet time may cause the QoS 
queue to become a member of a different QoS category (e.g., 
the single transmission may cause the current bandwidth to 
exceed the maximum bandwidth or the single idle time may 

60 cause the current bandwidth to fall below the minimum 
bandwidth) because the moving average moves very quickly 
over short time intervals. 

At step 330, the current bandwidth for a particular QoS 
queue is set to the current bandwidth for that QoS queue as 

65 calculated in the previous time interval multiplied by a first 
weighting factor plus the actual bandwidth that particular 
QoS queue received in the previous time interval multiplied 
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by a second weighting factor, wherein the weighting factors bandwidth and minimum bandwidth (e.g., Category A), 

may be selected to achieve the desired level of responsive- However, if no QoS queues meet this criteria, Category B is 

ness in the current bandwidth metric. For example, it may be selected. 

desirable to have the current bandwidth converge to within At step 430, if multiple QoS queues are members of the 

a certain percentage of a sustained bandwidth if that band- s selected QoS category, processing continues with step 440, 

width has been sustained for a certain amount of time. otherwise, processing branches to step 470. 

Exemplary weighting factors are in the form : (w-l)/w and M ^ ^ Qf ^ QqS are 

l/w,respectxvely. Using weighUng factors of ^y 16 for the first uscd tQ selcct a ^ QoS s of ^ 

weighting factor and a value of V« for the second weighting ^ ^ dm data 

factor, for example, the current bandwidth will reflect 50% 10 Mmn . „ „ 

of a step within 13 time intervals, 80% of a step within 27 At step 450, it two or more QoS queues have the same 

time intervals, and will be within 2% of the sustained P/ 10 ^ '^ Gn processing continues with step 460. Otherwise, 

bandwidth in approximately 63 time intervals (assuming a * a QoS <* ueiie ls L foun u d to have the higjiest relative pnonty, 

maximum and peak bandwidth of 100%). Alternative ratios then Passing branches to step 470. 

and current bandwidth metrics will be apparent to those of is M sle P 460 > the tie ^ resolved by performing round robin 

ordinary skill in the art. or LRU scheduling. That is, until the QoS categories are 

After the current bandwidth has been evaluated for a QoS evaluated, the QoS queues having the same priority will be 

queue, at step 340, the QoS queue bandwidth parameters can r ° tatc ? m a P^termined order or scheduled such 

be compared to the current bandwidth to determine to which ** x 9 oS ^ ueu f lha \ *J as not P r0Vlded a P acket for 

QoS category the QoS queue belongs. As described above, ™ ^mission recently will be given such an opportunity, 

if (CURR_BW<PEAK_BW) and (CURR_BW<MIN„ A&&T ******* a Q° s <l ueue in ttus manner > processing 

BW), then the QoS queue is associated with Category A at continues with step 470. 

step 350. If (CURR_BWIMINJW) and ((CURR__ At step 470, a packet is dequeued from the selected QoS 

BW<MAX_BW) and (CURR_BW<PEAK_BW)), then queue and the packet is transmitted by the port at step 480. 

the QoS queue is associated with Category B at step 360. If 25 This scheduling process may be repeated by looping back to 

(CURR_BW^PEAK_BW) or (CURR_BW^MAX_ step 410, as illustrated. 

BW), then the QoS queue is associated with Category C at „ . „ , 

step 370 Queuing Schemes 

At step 380, if all of the QoS queues have been evaluated, A variety of different queuing mechanisms may be imple- 

then processing branches to step 310; otherwise, processing 30 mented using various combinations of the QoS profile 

continues with step 330. attributes discussed above. Table 2 below illustrates how to 

achieve exemplary queuing mechanisms and corresponding 

Scheduling Processing configurations of the QoS profile attributes. 

Briefly, at each port, three levels of arbitration may be 35 

employed to select the appropriate QoS queue from which to TABLE 2 

transmit the next packet. The first level of arbitration selects Queuing Mechanism Configurations 

among the QoS categories. Category A is given priority if 

any member QoS queues have one or more pending packets. Queuing Mechanism QoS Profile Attribute Value 

Otherwise, a QoS queue with one or more pending packets 40 ^ ^ Minimum Bandwidth = o% 

of Category B is selected. According to one embodiment, the Maximum Bandwidth - ioo% 

relative priority assigned to each QoS queue may be used as Peak Bandwidth - ioo% 

a second level of arbitriation. In this manner, when multiple Maximum Delay - n/a 

QoS qLueLies satisfy the first level arbitration, a higher „ . „ „ t * elative p ™ ri * T I* 10 *^' 

. . rt « : r , . . °; n Round Robin/Least Minimum Bandwidth - 0% 

priority QoS queue IS favored over a lower pnonty QoS 45 Recently Used Queuing Maximum Bandwidth - 100% 

queue. Finally, when there is a tie at the second level of Peak Bandwidth - 100% 

arbitration (e.g., two or more QoS queues in the same QoS Maximum Delay - n/a 

category have the same relative priority), a round robin or w . u , e . „ . S: lfltivc ? i oxit J "f" 1 ^ a11 qucuM> ' 

- , . t r , , , Weighted Fair Queuing Minimum Bandwidth - >0% 

least recently used (LRU) scheme may be employed to Maximum Bandwidth - MAX_BW{ 

select from among the two or more QoS queues until the 50 Peak Bandwidth - PEAK_bw ( 

QoS categories are evaluated. Maximum Delay » n/a 

Assuming a periodic evaluation of QoS categories is 

being performed, the scheduling processing need not include 

such evaluation and the scheduling processing may be PRIORITY, represents a programmable priority value for 

performed as illustrated by FIG. 4, according to one embodi- 55 a particular QoS queue, i. Similarly, MAX_BW f - and 

ment of the present invention. In the embodiment depicted, PEAK_BW / represent programmable maximum band- 

at step 410, processing loops until the port associated with widths and peak bandwidths, respectively, for a particular 

the group of QoS queues being evaluated indicates it is ready QoS queue, i. 

to receive the next packet for transmission. For example, the For a strict priority scheme, each QoS queue's minimum 

port may be polled to determine its transmission status. 60 bandwidth is set to zero percent, each QoS queue's maxi- 

Altematively, the scheduling process may be triggered by an mU m bandwidth is set to one hundred percent, and each QoS 

interrupt. In any event, when the port is ready for the next queue's peak is set to one hundred percent. In this manner, 

packet, processing continues with step 420. me current bandwidth will never be less than the minimum 

At step 420, a QoS category is selected from which a QoS bandwidth, and the current bandwidth will never exceed 

queue will provide the next packet for transmission. As 65 either the peak bandwidth or the maximum bandwidth. In 

described above, priority is given to the category containing this configuration, all QoS queues will be associated with 

QoS queues with pending data that are below the peak Category B since no QoS queues will satisfy the criteria of 
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either Category A or Category B. Ultimately, by configuring 
the QoS profile attributes in this manner, the second level of 
arbitration (e.g., the relative priority of the QoS queues) 
determines which QoS queue is to source the next packet. 

For a pure round robin or least recently used (LRU) s 
scheme, the- QoS profile attributes are as above, but addi- 
tionally all QoS queue priorities are set to the same value. In 
this manner, the third level of arbitration determines which 
QoS queue is to source the next packet. 

Finally, weighted fair queuing can be achieved by 10 
assigning, at least, a value greater than zero percent to the 
desired minimum bandwidth. By assigning a value greater 
than zero to the minimum bandwidth parameter, the particu- 
lar QoS queue is assured to get at least that amount of 
bandwidth on average because the QoS queue will be 15 
associated with Category A until at least its minimum 
bandwidth is satisfied. Additionally, different combinations 
of values may be assigned to the peak and maximum 
bandwidths to prevent a particular QoS queue from monopo- 
lizing the link. 20 

Alternative Embodiments 

While evaluation of QoS categories has been described 
above as occurring periodically, this evaluation may also be 25 
triggered by the occurrence of a predetermined event. 
Alternatively, evaluation of QoS categories may take place 
as part of the scheduling processing rather than as part of a 
separate periodic background process. 

While a relationship between the number of priority levels 30 
and the number of QoS queues has been suggested above, it 
is appreciated that the number of QoS queues may be 
determined independently of the number of priority levels. 
Further, it is appreciated that the number of QoS queues 
provided at each port may be fixed for every port or 35 
alternatively a variable number of QoS queues may be 
provided for each port. 

Finally, in alternative embodiments, weighting factors 
and ratios other than those suggested herein may be used to 
adjust the current bandwidth calculation for a particular 40 
implementation. 

In the foregoing specification, the invention has been 
described with reference to specific embodiments thereof. It 
will, however, be evident that various modifications and 
changes may be made thereto without departing from the 45 
broader spirit and scope of the invention. The specification 
and drawings are, accordingly, to be regarded in an illus- 
trative rather than a restrictive sense. 

What is claimed is: 

1. A method for bandwidth management in a packet 50 
forwarding device, comprising: 

identifying a quality of service (QoS) metric correspond- 
ing to a traffic group, the QoS metric defining a 
minimum QoS for the traffic group; 

receiving a data packet associated with the traffic group; 55 

placing the data packet into one of a plurality of queues; 

identifying a current measure of network performance 
with respect to parameters specified in the QoS metric; 

and 60 
removing the data packet from the queue if a difference 
between the current measure and the minimum QoS 
falls within a threshold. 

2. The method of claim 1 wherein identifying the QoS 
metric corresponding to a traffic group further comprises: 

identifying the traffic group through an Internet Protocol 
(IP) subnet membership identifier; and 
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determining a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

3. The method of claim 1 wherein identifying the QoS 
metric corresponding to a traffic group further comprises: 

identifying the traffic group through a media access 
control (MAC) address; and 

determining a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

4. The method of claim 1 wherein identifying the QoS 
metric corresponding to a traffic group further comprises: 

identifying the traffic group through a virtual local area 
network (VLAN) identifier; and 

determining a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

5. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a mini- 
mum bandwidth for the traffic group. 

6. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a maxi- 
mum sustained bandwidth for the traffic group. 

7. The method of claim 6 wherein identifying the QoS 
metric comprises receiving information indicating a peak 
bandwidth representing a bandwidth in excess of the maxi- 
mum sustained bandwidth that the traffic group can utilize. 

8. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a maxi- 
mum allowable delay for the traffic group. 

9. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a relative 
priority associated with the traffic group. 

10. The method of claim 1 wherein determining a current 
measure of network performance occurs at specified inter- 
vals of time. 

11. The method of claim 1 wherein determining a current 
measure of network performance with respect to parameters 
specified in the QoS metric comprises calculating the current 
measure for the parameters specified in the QoS metric. 

12. The method of claim 1 wherein receiving the data 
packet comprises receiving the data packet on a first port of 
a plurality of ports, and wherein removing the data packet 
from the queue comprises transmitting the data packet from 
a second port of the plurality of ports. 

. 13. The method of claim 1 wherein the packet forwarding 
device employs a non-deterministic access protocol, 

14. The method of claim 13 wherein the non- 
determninistic access protocol employed by the packet for- 
warding device is the Carrier Sense Multiple Access with 
Collision Detection (CSMA/CD) protocol. 

15. An article of manufacture comprising a machine 
accessible medium having content that when accessed pro- 
vides instructions to cause an electronic system to: 

identify a quality of service (QoS) metric corresponding 

to a traffic group, the QoS metric defining a minimum 

QoS for the traffic group; 
receive a data packet associated with the traffic group; 
place the data packet into one of a plurality of queues; 
identify a current measure of network performance with 

respect to parameters specified in the QoS metric; and 
remove the data packet from the queue if a difference 

between the current measure and the minimum QoS 

falls within a threshold. 

16. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
to identify the QoS metric corresponding to a traffic group 
further comprises the content to provide instructions to 
cause the electronic system to: 
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identify the traffic group through an Internet Protocol (IP) 
subnet membership identifier; and 

determine a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

17. The article of manufacture of claim 15 wherein the 5 
content to provide instructions to cause the electronic system 

to identify the QoS metric corresponding to a traffic group 
further comprises the content to provide instructions to 
cause the electronic system to: 

identify the traffic group through a media access control 10 

(MAC) address; and 
determine a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

18. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
to identify the QoS metric corresponding to a traffic group 
further comprises the content to provide instructions to 
cause the electronic system to: 

identify the traffic group through a virtual local area 2 o 
network (VLAN) identifier; and 

determine a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

19. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 25 
to identify the QoS metric comprises the content to provide 
instructions to cause the electronic system to receive infor- 
mation indicating a minimum bandwidth for the traffic 
group. 

20. The article of manufacture of claim 15 wherein the 30 
content to provide instructions to cause the electronic system 

to identify the QoS metric comprises the content to provide 
instructions to cause the electronic system to receive infor- 
mation indicating a maximum sustained bandwidth for the 
traffic group. 35 

21. The article of manufacture of claim 20 wherein the 
content to provide instructions to cause the electronic system 
to identify the QoS metric comprises the content to provide 
instructions to cause the electronic system to receive infor- 
mation indicating a peak bandwidth representing a band- 40 
width in excess of the maximum sustained bandwidth that 
the traffic group can utilize. 

22. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
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to identify the QoS metric comprises the content to provide 
instructions to cause the electronic system to receive infor- 
mation indicating a maximum allowable delay for the traffic 
group. 

23. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
to identify the QoS metric comprises the content to provide 
instructions to cause the electronic system to receive infor- 
mation indicating a relative priority associated with the 
traffic group. 

24. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
to determine a current measure of network performance 
comprises the content to provide instructions to cause the 
electronic system to determine the current measure at speci- 
fied intervals of time. 

25. Hie article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
to determine a current measure of network performance with 
respect to parameters specified in the QoS metric comprises 
the content to provide instructions to cause the electronic 
system to calculate the current measure for the parameters 
specified in the QoS metric. 

26. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
to receive the QoS metric comprises the content to provide 
instructions to cause the electronic system to receive the data 
packet on a first port of a plurality of ports, and wherein the 
content to provide instructions to cause the electronic system 
to remove the data packet from the queue comprises the 
content to provide instructions to cause the electronic system 
to transmit the data packet from a second port of the plurality 
of ports. 

27. The article of manufacture of claim 15 wherein the 
packet forwarding device employs a non-deterministic 
access protocol. 

28. The article of manufacture of claim 27 wherein the 
non-deterministic access protocol employed by the packet 
forwarding device is the Carrier Sense Multiple Access with 
Collision Detection (CSMA/CD) protocol. 

* * * * * 
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